Added kl_divergence for multivariate normals #1654

lumip · 2023-09-29T12:01:52Z

KL implementation for MultivariateNormal

EDIT: just saw that there is already a pull request for this from a year ago (#1487). What's the hold up with that?

fehiepsi

Thank you @lumip! The author of #1487 did not response so I think we can use this PR instead. I have a few comments below to address the batching issues.

numpyro/distributions/kl.py

making the linter tests happy

lumip · 2023-10-20T14:03:32Z

Thanks for the feedback @fehiepsi . I have modified the code according to your suggestions and also added a test case where the batch dimensions of the two distributions are not identical (and fixed the linter complaints). If everything is good now, I'd still rebase onto the recent master and merge the commits.

fehiepsi

LGTM, thanks @lumip! I have a few small comments on the assertions.

fehiepsi · 2023-10-20T14:18:13Z

numpyro/distributions/kl.py


-        Lq_inv = solve_triangular(q_scale_tril, jnp.eye(D), lower=True)
+    q_half_log_det = jnp.log(jnp.diagonal(q.scale_tril, axis1=-2, axis2=-1)).sum(-1)
+    assert q_half_log_det.shape == q.batch_shape


nit: this assertion might not be true. In MultivariateNormal implementation, we avoid unnecessary broadcasting (e.g. we can have a batch of means with a single scale_tril).

Right, I wasn't thinking of that. Removed the assertion and added tests for those cases.

fehiepsi · 2023-10-20T14:23:31Z

numpyro/distributions/kl.py

            f" {p.event_shape} and {q.event_shape} for p and q, respectively."
        )

-    if p.batch_shape != q.batch_shape:
+    min_batch_ndim = min(len(p.batch_shape), len(q.batch_shape))
+    if p.batch_shape[-min_batch_ndim:] != q.batch_shape[-min_batch_ndim:]:


how about only assert that p.batch_shape and q.batch_shape can be broadcasted.

try: result_batch_shape = jnp.broadcast_shapes(p.batch_shape, q.batch_shape) except ValueError: raise ...

fehiepsi · 2023-10-20T14:24:53Z

numpyro/distributions/kl.py

-        assert jnp.ndim(q_mean) == 1
-        assert jnp.ndim(p_scale_tril) == 2
-        assert jnp.ndim(q_scale_tril) == 2
+    assert q.mean.shape == q.batch_shape + q.event_shape


those assertions are unnecessary.

fehiepsi · 2023-10-20T14:25:27Z

numpyro/distributions/kl.py


-        return .5 * (tr + t1 - D - log_det_ratio)
+    tr = _batch_trace_from_cholesky(Lq_inv @ p.scale_tril)
+    assert tr.shape == result_batch_shape


this assertion might not be true.

fehiepsi · 2023-10-20T14:26:03Z

numpyro/distributions/kl.py

-    p_mean_flat = jnp.reshape(p.mean, (-1, D))
-    p_scale_tril_flat = jnp.reshape(p.scale_tril, (-1, D, D))
+    t1 = jnp.square(Lq_inv @ (p.loc - q.loc)[..., jnp.newaxis]).sum((-2, -1))
+    assert t1.shape == result_batch_shape


this assertion might not be true

fehiepsi · 2023-10-20T14:26:30Z

numpyro/distributions/kl.py

-        ).sum(-1)
-        log_det_ratio = 2 * (p_half_log_det - q_half_log_det)
+    p_half_log_det = jnp.log(jnp.diagonal(p.scale_tril, axis1=-2, axis2=-1)).sum(-1)
+    assert p_half_log_det.shape == p.batch_shape


this assertion might not be true

fehiepsi · 2023-10-20T14:27:14Z

test/test_distributions.py

+        ((), ()),
+        ((1,), (1,)),
+        ((2, 3), (2, 3)),
+        ((5, 2, 3), (2, 3)),


could you change this to (5, 1, 3) and (2, 3)?

fehiepsi · 2023-10-20T14:27:46Z

test/test_distributions.py

+        ((1,), (1,)),
+        ((2, 3), (2, 3)),
+        ((5, 2, 3), (2, 3)),
+        ((2, 3), (5, 2, 3)),


nit: maybe ((1, 3), (5, 2, 3))?

fehiepsi

Thanks, @lumip!

lumip force-pushed the mvn-kl-divergence branch from 4d999ca to 304cab7 Compare September 29, 2023 12:09

Added kl_divergence for multivariate normals

4e3beae

lumip force-pushed the mvn-kl-divergence branch from 304cab7 to 4e3beae Compare September 29, 2023 12:14

fehiepsi reviewed Oct 11, 2023

View reviewed changes

numpyro/distributions/kl.py Outdated Show resolved Hide resolved

numpyro/distributions/kl.py Outdated Show resolved Hide resolved

numpyro/distributions/kl.py Outdated Show resolved Hide resolved

fehiepsi added the awaiting response label Oct 20, 2023

style fixes

1249cad

making the linter tests happy

fehiepsi reviewed Oct 20, 2023

View reviewed changes

kl for multivariate normal now deals with possible batch shapes

2531e8e

fehiepsi approved these changes Oct 27, 2023

View reviewed changes

fehiepsi merged commit eaa29a0 into pyro-ppl:master Oct 27, 2023
4 checks passed

fehiepsi mentioned this pull request Oct 27, 2023

KL for MVGaussians #1487

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added kl_divergence for multivariate normals #1654

Added kl_divergence for multivariate normals #1654

lumip commented Sep 29, 2023 •

edited

Loading

fehiepsi left a comment

lumip commented Oct 20, 2023

fehiepsi left a comment

fehiepsi Oct 20, 2023

lumip Oct 25, 2023

fehiepsi Oct 20, 2023

lumip Oct 25, 2023

fehiepsi Oct 20, 2023

lumip Oct 25, 2023

fehiepsi Oct 20, 2023

lumip Oct 25, 2023

fehiepsi Oct 20, 2023

lumip Oct 25, 2023

fehiepsi Oct 20, 2023

lumip Oct 25, 2023

fehiepsi Oct 20, 2023

lumip Oct 25, 2023

fehiepsi Oct 20, 2023

lumip Oct 25, 2023

fehiepsi left a comment

Added kl_divergence for multivariate normals #1654

Added kl_divergence for multivariate normals #1654

Conversation

lumip commented Sep 29, 2023 • edited Loading

fehiepsi left a comment

Choose a reason for hiding this comment

lumip commented Oct 20, 2023

fehiepsi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fehiepsi left a comment

Choose a reason for hiding this comment

lumip commented Sep 29, 2023 •

edited

Loading